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1/'^ . Abstract 

f^ . We prove that every connected triangle-free graph on n vertices contains an induced tree on 

rf) ' exp(cyiogn) vertices, where c is a positive constant. The best known upper bound is (2 + o{l))y'7i. 

This partially answers questions of Erdos, Saks, and Sos and of Pultr. 
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^ ; 1 Introduction 

For a graph G, let t(G) denote the maximum number of vertices of an induced subgraph of G that is a 
tree (i.e., connected and acychc). There are arbitrary large graphs G with t{G) < 2, namely graphs in 
which every connected component is a clique. To rule out these trivial examples, we need to put some 

^^ i restrictions on G. 

^~^ • Motivated by study of forbidden configurations in Priestley spaces [T], Pultr (private communication, 

QQ I 2002) asked how big t{G) can be if G is connected and bipartite. Formally, he was interested about 

Tjlj- ■ asymptotic properties of the function 

fsin) = min{i(G') : |V^(G)| — n, G connected and bipartite}. 

f~^ , Pultr's question was the starting point of our work. However, the function t{G) was studied earlier and 

in a more general context by Erdos, Saks, and Sos [2]. They describe the influence of the number of edges 
of G on t{G) and, more to our point, they study how small t{G) can be if w(G) is given. They observe 
that t{G) < 2a{G), and this allows them to use estimates for Ramsey numbers. This way, they show 
that for any fixed fc > 3 there are constants ci, C2 such that 
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^^ logr^ < min{i(G) : |F(G)| - n, G ^ A'^} < calogn. 
log log n 

For fc = 3 the lower bound still applies, but the upper bound obtained by using Ramsey numbers was only 
0{^/n\ogn) (nowadays this approach yields 0{\/nlogn), due to the improved lower bound on i?(fc,3), 
see [4]). We concentrate on this case fc = 3, that is we put 

frin) = min{t(G) : |V^(G)| = n, G connected and triangle- free} . 

Instead of applying Ramsey theory, we approach the problem directly. 
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It is easy to show that frin) < fsin) = 0{y/n). The best construction we are aware of yields 
/b('^) < (2 + o(l))v^; see Section[2j A simple "blow-up" construction, also presented in Section[2l shows 
that if /t(?t-o) < \/no for some uq, then frin) — 0{n^'^^^) for a positive constant e > 0, and similarly 
for Jb- Hence, frin) either is of order exactly y/n, or it is bounded above by some power strictly smaller 
than 1/2. We conjecture that the second possibility holds, and that another power of n is a lower bound. 

Conjecture 1.1 There are constants Q < a < (3 < 1/2, and Ci, Ci such that for all n 

cm'' < frin) < fsin) < C2n^ . 

The following lower bound is the main result of this paper. 

Theorem 1.2 There is a constant c > such that for all n 

/T(n)>e^Vi^. 

We finish the introduction by mentioning further results concerning t{G). It is interesting to consider 
the problem of finding induced trees in (sparse) random graphs. Vega 3_ shows that t(G„ c/n) = ^{n) 
a.s.; Palka and Rucihski [6] prove that i(G'„ ciogn/n) — O(riloglogn/logri) a.s. 

Krishnan and Ochem [5] search for values of frin) (for small n) using a computer; they succeed to 
find frin) for n < 15. They also extend results of |2j about the decision problem: "given a connected 
graph G and an integer t, does G have an induced tree with t vertices?" . Not only this is NP-complete 
for general graphs (which is proved in [2]), but it remains NP-complete even if we restrict to bipartite 
graphs, or to triangle-free graphs of maximum degree 4. 

2 Initial observations 

Observation 2.1 /^(n) < {2 + o{l))^. 

Proof: It is enough to take a path with each edge replaced by a complete bipartite graph. More precisely, 
we take pairwise disjoint sets Vi (for i = —{k — 1), . . . , fc — 1) such that \Vi\ = k — \i\. We let G be the 
graph with vertices V = lj|i|<fc ^ ^^'^ *11 possible edges between Vi and Vi+i (for i = — (fc— 1), . . . , fc — 2). 
It is clear that if an induced tree in G contains a vertex from Vi and two vertices from Vi^i then it 
contains no vertex of Vj for j > i + 1; similarly for i + 1 replaced by i — 1. Therefore any maximum 
induced tree is one of trees Ta^b {—{k — l)<a<b<k— 1 and b — a > 1): it contains all vertices from 
two levels, Va and VJ, and one vertex from each Vi where a < i < b. It is easy to compute that such tree 
contains 2fc — 1 vertices out of the \V\ = fc^; this proves /^(fc^) < 2fc — 1. If (fc — 1)^ < n < k^ then we 
take a subgraph of G to show that fsin) < 2fc — 1 < 2^/n +1. D 

Lemma 2.2 (Blow-up construction) Let G be a connected triangle-free graph and let W C V{G) be 
a subset of m vertices (m > S) such that any induced tree in G contains at most t vertices of W . Then 
we have frin) = 0(n'"(*^i'/'"(™~i)). The same result holds with "triangle- free" replaced by "bipartite" 
and with fx replaced by fs- 

Proof: We let W = {wo, ■ ■ ■ , Wm-i}, and write r — m — 1 and q = t — 1 to simplify expressions. As G 
is triangle- free it follows that i > 3, and so g > 2. 

Let T = Trj be a rooted tree with / levels in which each non-leaf vertex has r sons. Next, for each 
vertex -y of T we take a copy G„ of G (so that distinct copies are disjoint). Whenever w is a non-leaf vertex 
of T and u is its i-th son, we introduce an edge between Wi in Gy and wq in G„; the resulting graph will 
be called T{G) (see Fig. [Ij. Clearly this graph is triangle- free/bipartite if G was triangle- free/bipartite. 
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Moreover, |V^(r(G))| = |V^(r)| • \V{G)\ and |V^(r)| = ^-—^ = Q{r^) (since ^ -^ oo and r > 2) 




Figure 1: Graph T3 2(G) from the proof of Lemma 



Let S be an induced subtree of T{G) and put 

S = {v £ V{T) I Gv contains a vertex of S} . 

By construction, S D G^ is a tree in Gy for each v. So the condition on G imphes that each vertex of S 
has at most t neighbors in S. Consequently, we have (since q >2) 

q' -1 



1^1 < 1 + E(9 + 1)9^"' < 1 + (9 + 1)^ = 0(9') 

-^-^ (7—1 



Now recall that q, r, and |X^(G)| are constants. For a given n, choose the smallest I such that 
n < \V{Tr^i{G))\; we have n = 0(r'). By the above considerations, 

fin) < fiTrAG)) < \ViG)\ ■ e(q') = e(r"°S'-«) = e(n'°S'-'?) , 

which finishes the proof. □ 



Corollary 2.3 ///t("o) < \/no for some uq, then fxin) = 0{n^'^ '^) for a positive constant e > 0. 
(The same is true for fs-) 

Proof: Let G be the graph on uq vertices for which t(G) = t < y^UQ. We let W — V{G) and m = n^ 
and apply Lemma [2121 ^ 



As mentioned in the introduction, Krishnan and Ochcm [S] search for values oi frin) using a computer. 
This was motivated by hope that CoroUarv 12.31 would apply. It turns out, however, that for small n 
Observation 12.11 gives a precise estimate even for frin) (e.g., /t(15) ~ 7); therefore Corollary 12.31 does 
not apply. 

Remark. If we consider the construction from Lemma I^T^ for G = K3, W = V{G), m — 3, and i = 2 we 
recover a result of [5] that there is a graph G containing triangles (but no K4) such that t{G) — O(logn). 



3 Lower bound for bipartite graphs 

Here we prove a statement weaker than Theorem 11.21 — we give a bound on fsin) instead of /^(n). The 
proof is simpler than that of Theorem 11.21 and it serves as an introduction to it. 

We begin with a lemma about selecting induced forests of a particular kind in a bipartite graph. We 
introduce some terminology. Let iJ be a bipartite graph with color classes A and B. We will think of A 



as the "top" class and B as the "bottom" class (in a drawing of G in the plane, say). We write a — \A\ 
and b = \B\. For a subgraph F of i? we write A{F) = V{F) n A, we set a{F) = |A(i^)|, and we define 
B{F) and b{F) similarly. 

Whenever we say forest we actually mean an induced subgraph of H that is a forest. An up- forest 
F is a forest such that every vertex in A{F) has degree (in F) precisely 1 and every vertex in B{F) has 
degree (in F) at least 1. 




Figure 2: An up- forest 

A matching is a forest F in which all vertices have degrees (in F) exactly 1. 

Lemma 3.1 Let H be a bipartite graph with color classes A and B as above, let A be the maximum 
degree of H , and let rj G (0, 1) be a real parameter. Let us suppose that every vertex in A is connected to 
at least one vertex in B. Then at least one of the following cases occurs: 

(M) There is a matching with at least (1 — 77)a edges. 

(B) There is an up-forest F with 

that is 2-branching, meaning that every vertex in B{F) has degree at least 2 in F. 




A 



B 




Figure 3: An illustration of Lemma [57T] 



Proof. Let B' C B he the set of vertices of degree 1 in B. If \B'\ > (1 — ri)a then, clearly, case (M) 
occurs, so we may assume \B'\ < (1 — T])a. Let B" C B consist of all vertices of degree at least 2. Since 
every vertex in A has degree at least 1, \E{H \ N{B'))\ > rja, and so \B"\ > {tj/ /S)a. 

Let us set Bq = B" and let Fq be an empty graph. Supposing that a set Bi-i C B" and an up-forest 
Fi-i have already been constructed with Bi-i ^ 0, we construct Bi and Fi. We let Vi be an arbitrary 
vertex in i?i_i, and we let Si be the star formed by Vi and all of its neighbors in A. We set Fi — Fi^iUSi, 
we let Ni C B he the neighborhood of A{Si), and we let Bi he i?i_i \ Ni. The construction finishes when 
Bi =%, with Fi as the resulting up- forest. 

It is easy to check that this construction indeed yields an up- forest F with each degree in B{F) at least 
2. We have a(S'i) < A and \Ni\ < a(S'i)(A- l)-hl, and so in each step, at most \Ni\ < A(A-l)-hl < A^ 
vertices are removed from Bi. Having started with at least {ri/A)a vertices, we can proceed for at least 
{r]/A^)a steps, and so the resulting up-forest is as in (B). D 



Now we prove the lower bound 



fsin) > e' 



for a constant c > 0. 

Let G be a given connected bipartite graph. We assume that n — \V{G)\ is sufficiently large whenever 
convenient. We let t be the "target size" of an induced tree in G we are looking for; namely, t = 
\exp{cy/\ogn)~\ . If G has a vertex of degree at least t — 1, then we can take its star for the induced tree 
and we are done, so we may assume that the maximum degree satisfies A < t — 2. 

Let us fix an arbitrary vertex of G as a root, and let Li be the set of vertices of G at distance precisely 
i from the root. All edges of G go between L^-i and L, for some i, since an edge within some Li would 
close an odd cycle. 

We may assume that Lt — 9, for otherwise G contains an induced path of length t. Hence there is a 
k with \Lk\ > n/t. 

Let us fix such a k. We are going to construct sets Mi C Li, i = fc, /e — 1, . . ., inductively, until we 
first reach an i with \Mi\ — 1 (this happens for j = at the latest since \Lo\ — 1). We shall let £ be this 
last i. 

Suppose that nonempty sets Mk, M^-i, . . . , Mi have already been constructed, in such a way that the 
subgraph of G induced by Mk U • • • U Mi is a forest, each of whose components intersects Mi in at most 
one vertex. We are going to construct Afi_i. 

Let us put A = Mi, B = Li_i, and let us consider the bipartite graph H induced by ^ U S in G. 
Every vertex of A is connected to at least one vertex in B. We set rj = j and apply Lemma [3. II This 
yields an up-forest i^ in iJ as in the lemma. We define Mi_i = B{F). 

If i^ is a matching, i.e., case (M) occurred in the lemma, we call the step from Mi to Mi_i a matching 
step. In this case, we have |Mj_i| > (1 — i)|Mi|. Otherwise, F is a 2-branching forest; then we call the 
step a branching step and we have |Mi_i| > \Mi\/{tA?) > \Mi\/t^. 

Suppose that the sets Mk, ■ ■ ■ ,Mi have been constructed, \AIg\ — 1. We claim that the number b of 
branching steps in the construction is at least ci^/Togn for a suitable constant ci > 0. Indeed, there are 
no more than t matching steps, and so 1 = \Mi\ > |Mfe|(l - 1/^)*^"'*'' > {n/t)e~'^/2 ■ t-^^ = n{nt-^^-^). 
Thus b — 0(logn/logt) = 0(-\/Iogn), since t = [exp(c-\/Iogn)] . 

It is easy to see that Mk U Mk-i U • • • U M^ induces a forest in G. We let T be the component of this 
forest containing the single vertex of Mi. Since every vertex of Mi_i, £ < i < k, has at least one neighbor 
in Mi, and if the step from Mi to Mi_i was a branching step then each vertex of Mi_i has at least two 
neighbors in Mi, it follows that T has at least 2^ = exp(il(-yiogn)) vertices. This finishes the proof of 
the lower bound fsin) > exp(c\/logn ). □ 

Remark. The above proof may seem wasteful in many respects. However, the result is tight up to the 
value of the constant in the exponent if we insist on selecting an induced tree "growing up" (i.e., made of 
up-forests for some choice of root and corresponding sets Li). Indeed, any such induced tree in the graph 
Gr in Figure |4] may contain at most two of the r vertices at the topmost level of the graph. Let us put 
r = exp(c%/Togn ) and glue copies of Gr according to the pattern of a complete r-ary tree (as in the proof 
of Lemma l2.2|) . so that the resulting graph has approximately n vertices (that is, the depth is I = 8(logn). 
We obtain a graph with all up-growing induced trees having size at most 2' — exp(0(^logn)). 




Figure 4: Graph Gg in which all "up-growing trees" contain at most two vertices of the uppermost level. 



4 Lower bound for triangle-free graphs 



Here we prove Theorem 11.21 The scheme of the proof is very similar to the proof of the same bound for 
bipartite graphs in Section [31 We continue using the definitions and notation from that proof. So we 
decompose the given graph into the levels Lq, Li, . . . , Lr, r < t. The main difference compared to the 
bipartite case is that there may now be edges within the levels L,;. We will need the well-known fact that 
any graph on n vertices with maximum degree A contains an independent set of size at least f^/(A + 1). 
We will also need the following simple modification. 

Lemma 4.1 Let T he a graph (not necessarily bipartite) on n vertices with maximum degree A, and let 
r] G [0, 1] be a real parameter. Then at least one of the following two cases occurs: 

(IS) r contains an independent set with at least (1 — 77)71, vertices. 

(IM) r contains an induced matching with at least ^n edges. 

Proof. We repeatedly select edges ei, 62, . . . of F; having selected a, we delete it and all the neighbors of 
its endvertices from the current graph. In each step we delete at most 2A vertices, so we either construct 
an induced matching as in (IM) or reach an edgeless graph after deleting at most rjn vertices, hence 
yielding an induced set as in (IS). □ 

Proof of Theorem 11.21 We proceed similarly as in the previous section. We suppose G is a given 
triangle- free graph on n vertices (and that n is big enough), we put t = [exp(c-v/Iogn )] . Again, we may 
assume i < A — 2: G is triangle- free, so a star of a vertex is an induced tree. 

As before, we begin by selecting a root vertex and constructing the at most t levels Lq, -^1, ■ • ■ ■ We 
select k such that \Lk\ > n/t and we will construct sets Mk, Mk-i, ■ ■ ■ , Me, such that Mi C Li, \Mi\ — 1, 
in such a way that their union induces a forest in G. In the induction step, we will either construct Mi_i 
from Mi, or sometimes we will go down two levels at once, producing both M,;_i and Mi-2- 

We begin by selecting M^ as an independent set in the subgraph induced by Lfe. By the fact mentioned 
before Lemma |4T] we may assume \Mk\ > \Lk\/t> n/t^. 

We suppose that Mi has already been constructed so that each component of the forest induced by 
Mk U • • • U Mi intersects Li in at most one vertex (and, in particular. Mi is an independent set). Now 
we proceed as in the proof in Section [3l We let A = Mi, B = Li_i, and we consider the bipartite graph 
H induced by A U i? in G. We apply Lemma [3. II to H with r/ — j, obtaining an up- forest F. We set 
M-_-^ = B{F); this is not yet the final Mi since there may be edges on M[_-y. 

If case (B) occurred in Lemma [Ol we have |Mj'_]^| > |Mi|/i^. We let Mi_i be an independent set of 
size |Mj'|/(A -I- 1) > \Mi\/t^ in the subgraph induced by M'i_i. We call this step a branching step. 

If case (M) occurred in Lemma [3Tl we have |M/_;^| > (1 — i)|Mi|. Then we apply Lemma WJ\ with 
77 = I to the graph F induced in G by M[_t^. If case (IS) applies in that lemma, we let Mi_i be the 
independent set of size at least (1 — y)|Mj'_]^| > (1 — i)^|Mi|; we call this step a matching step. Both the 
matching step and the branching step go one level down, from i to i — 1. 

If case (IM) applies in Lemma ITTl we define Mi-i as the vertex set of the induced matching from the 
lemma. In this case we have |-Mi_i| > (1 — \)\Mi\lt^ . Note that this Mi_i does not satisfy the inductive 
assumption (it is not an independent set). We are also going to construct Mi_2 in the same step, thus 
going from i to i — 2. To obtain Mi_2, we define another auxiliary bipartite graph, which we again call 
H to save letters. The bottom color class B is Li_2, and the top color class A is obtained by contracting 
the edges induced by Mi_i. More formally, we set A — {uu' G E{G) : u,u' G Mi-2}, B — Li_2, and 
E{H) — {^{uu' ,v} : u,u' & A,v € B, uv G E{G) or u'v € E{G)]. (Note that in this definition it can not 
happen that both uv and u'v are edges of G, as G is triangle- free.) We apply Lemma [3T] with rj — ^, 
say, to H. In both of cases (M) and (B) we obtain an up-forest F in H with b{F) > |Mi_i|/(32t^) (we 
note that \A\ = i|Mi_i| and that H has maximum degree no larger than 2t). We set M/_2 = B{F), 
and finally, we select Mi_2 as an independent set of size at least |M/_2l/^ in the subgraph induced by 
M'i_2. Since G is triangle- free, one can check that Mk U • • • U Mi-i U Mi-2 induces a forest. We have 
|M,_2| > |Mi_i|/32t4 > |M,| • (1 - \)/?,2t^ > \Mi\/t'^. We call this step from M, to Mi-2 a double-step. 



By calculation similar to that in Section [31 we find that the number b of branching steps and double- 
steps together is at least rt{^/\ogn). We again claim that the component of the forest induced by 
Mk U • • • U Ml containing the single vertex of Mi has at least 2^ vertices. Indeed, if Ali was obtained 
from Afi+i by a branching step, then each vertex of Mi has at least two successors in M^+i. If Mi was 
obtained from Mi+2 by a double-step, then each vertex v of Mi has at least one succesor in M^+i, this is 
connected by an edge to precisely one other vertex of M^+i, and both of these vertices have one neighbor 
in Mi+2; consequently v has at least two successors in Mi^2- Theorem II .21 is proved. □ 
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